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DETAILED ACTION 

1 . Applicant has amended claims 1 , 11, 17, 27 and 34 in the amendment filed on 
1/17/2006. 

Claims 1-38 are pending in this Office Action. 

Response to Arguments 

2. Applicant's arguments with respect to claims 1-38 have been considered but are 
moot in view of the new ground(s) of rejection. 

Claim Rejections - 35 USC § 103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

4. Claims 1, 2, 3, 5, 6, 10, 17-24, 26, 34, 37-38 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Madnick in view of lizuka et al (or hereinafter "lizuka") (US 
6424980) and Bates et al (or hereinafter "Bates") (US 6873982). 

As to claim 1 , Madnick teaches a method for extracting data from a network by a 
server (col. 3, lines 1-7; col. 1-2); 

"enabling a database-structured query with at least one fundamental clause to be 
generated by a user" as the request translator receives a data request from data 
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receiver 102 and translates the data request into a query at the wrapper generator 614. 
The converter query converts a least a portion of the query into a command to interact 
with a semi-structured data sources such as HTML documents, flat files containing data 
that are not arranged as a relational database. The above information shows that a 
command is created based on the data request. The data request from data receiver 
102 is represented as a user input. The command is represented as the database- 
structured query that is not generated by a user. The wrapper generator 614 is 
represented as a server (col. 2, lines 46-55, col. 2, lines 30-33); 

"determining a web domain address on the network from which to extract the 
data" as determining a URL on the network to extract the data (table 2, col. 12, lines 1- 
10, lines 1-5); 

"extracting the data from the web domain address directly by retrieving a non- 
database structured arrangement of data from the determined web domain address and 
performing the database-structured query upon the retrieved non-database structured 
arrangement of data" as at least a portion of the query is converted into one or more 
commands which can be used to interact with a semi-structured data source. Those 
commands are issued and data is extracted from the data source. In this case a 
source is located at an address or URL. The above information shows that the data is 
extracted from a semi-structured data source based on the address of the source and 
the command (fig. 7, col. 10, lines 25-32; col. 2, lines 2-8). 
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Madnick does not explicitly teach the claimed limitation "with at least one 
fundamental clause to be generated by a user; providing extracted data in data log 
directly to the user 1 '. 

lizuka teaches the user interface unit receives a search request (query 
statement) consisting of search items and search condition (col. 13, lines 35-40). 

Bates teaches storing extracted documents as results in result cache and 
returning a first document as a first result from the result cache directly to a user (fig. 10, 
col. 1 1 , lines 60-67; col. 12, lines 1-27). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply lizuka's teaching user interface unit receives a search 
request (query statement) consisting of search items and search condition and Bates 
teaches storing extracted documents as results in result cache and returning a first 
document as a first result from the result cache directly to a user to Madnick's system in 
order to provide a result to a user quickly after retrieving data from a plurality of semi- 
structured document via open network without need a lot of time and labor to design 
and manage. 

As to claim 2, Madnick teaches the claimed limitation "wherein creating the 
database-structured query further comprises, including a network address within the 
database-structured query indicating a starting point" as creating a command after 
converting at least a portion of a query, the command includes a network address as 
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URL: http://quotes.qalt.com/ . This URL is indicated as a starting point (Table 2, col. 7, 
lines 25-32; col. 2, lines 5-10). 

As to claim 3, Madnick teaches the claimed limitation "wherein the determined 
web domain address, includes at least one universal resource locator (URL)" as the 
URL (col. 12, lines 5-10, table 2). 

As to claim 5, Madnick teaches the claimed limitation "wherein creating the 
database-structured query, further comprises, creating a regular expression within the 
database-structured query used to determine the data to extract" creating regular 
expression with a specification file 706 as a command to determine the data to extract 
(col. 10, lines 2-5; col. 12, lines 5-10, table 2). 

As to claim 6, Madnick teaches the claimed limitation "wherein directly extracting 
data from the web domain, further comprises, matching a plurality of patterns contained 
within the regular expression to the retrieved data to determine the data to extract" as 
each variable to be retrieved in a given state, the state description contains a pattern to 
be matched against the document or semi-structured data source. The above 
information shows that matching each pattern of each variable contained with the 
regular expression (col. 15, lines 1-10). 
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As to claim 10, Madnick teaches the claimed limitation "reshaping at least a 
portion of the extracted data for use by at least one data analysis software program" as 
extracted data is translated by the data translator from the data context of the data 
source into the data context associated with the initial request. It means that the 
extracted data is reshaped by translating. The above information shows that the system 
has included a data analysis software program to translate the extracted data (col. 3, 
lines 6-8). 

As to claim 17, Madnick teaches the claimed limitations: 

"a client computer system having a client network connection to the network and 
communicating with a server computer system" as (col. 3, lines 60-67; col. 4, lines 1-5); 

"the server computer system having a server network connection to the network 
and communicating with the client computer system" as (col. 3, lines 60-67; col. 4, lines 
1-5), "the server computer system further configured to perform actions, comprising: 

receiving the database-structured query from the client computer system as the 
request translator receives a data request from data receiver 102 and translates the 
data request into a query. The converter query converts a least a portion of the query 
into a command to interact with a semi-structured data sources such as HTML 
documents, flat files containing data that are not arranged as a relational database. The 
above information shows that a command is created based on the data request. The 
data request from data receiver 102 is represented as a user input. The command is 
represented as the database-structured query (col. 2, lines 46-55, col. 2, lines 30-33); 
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"determining a web domain address on the network from which to extract at least 
a portion of the data relevant to the query, wherein the determined web domain address 
is provided by the database-structured query" (fig. 7, col. 10, lines 25-32; col. 2, lines 2- 
8); 

"extracting at least the portion of the data from the web domain address directly 
by retrieving a non-database structured arrangement of data from the determined web 
domain address and performing the database-structured query upon the retrieved non- 
database structured arrangement of data" as (fig. 7, col. 10, lines 25-32; col. 2, lines 2- 
8). 

Madnick does not explicitly teach the claimed limitation "the client creating a 
database-structured query with at least one fundamental clause, based, in part, on a 
user input; providing extracted data directly to the user". 

lizuka teaches client 100 creating a database structured query based, in part, on 
a user input (fig. 39, col. 13, lines 35-45). 

Bates teaches storing extracted documents as results in result cache and 
returning a first document as a first result from the result cache directly to a user (fig. 10, 
col. 11, lines 60-67; col. 12, lines 1-27). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply lizuka's teaching of client 100 creating a database 
structured query based, in part, on a user input and Bates teaches storing extracted 
documents as results in result cache and returning a first document as a first result from 
the result cache directly to a user to Madnick's system in order to provide a result to a 
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user quickly after retrieving data from a plurality of semi-structured document via open 
network without need a lot of time and labor to design and manage. 

As to claim 18, Madnick teaches the claimed limitation "wherein the database- 
structured query, further comprises, a network address within the database-structured 
query indicating a starting point" as (table 2, col. 12, lines 5-10). 

As to claim 19, Madnick teaches the claimed limitation "a regular expression 
within the database-structured query used to determine the data to extract" as (col. 10, 
lines 2-5; col. 12, lines 5-10, table 2). 

As to claim 20, Madnick teaches the claimed limitation "wherein the regular 
expression with the database-structured query further comprises at least one pattern 
used to determine the data to extract" as (col. 10, lines 2-5; col. 12, lines 5-10, table 2). 

As to claim 21 , Madnick teaches the claimed limitation "an editor for creating a 
template of regular expressions used to extract the data" as (col. 12, lines 5-10, table 2). 

As to claim 22, Madnick teaches the claimed limitation " at least one data 
extraction engine to extract the data" as (col. 15, lines 25-35). 
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As to claim 23, Madnick teaches the claimed limitation "wherein the data 
extraction engine is a web crawler" as the wrapper generator 614 (col. 15, lines 25-35). 

As to claim 24, Madnick teaches the claimed limitation " wherein the web 
domain address further comprises at least one link address for locating at least a 
portion of the data" as (col. 9, lines 55-67; col. 10, lines 1-5). 

As to claim 26, Madnick teaches the claimed limitation "wherein the web domain 
address further comprises a link address, wherein at least another portion of the data is 
located with the link address" as (col. 9, lines 55-67; col. 10, lines 1-5). 

As to claim 34, Madnick teaches the claimed limitation: 

"generating a database structured query with at least one fundamental clause 
based, in part, on user input" as the request translator receives a data request from data 
receiver 102 and translates the data request into a query at the wrapper generator 614. 
The converter query converts a least a portion of the query into a command to interact 
with a semi-structured data sources such as HTML documents, flat files containing data 
that are not arranged as a relational database. The above information shows that a 
command is created based on the data request. The data request from data receiver 
102 is represented as a user input. The command is represented as the database- 
structured query. The wrapper generator 614 is represented as a server (col. 2, lines 
33-55, col. 8, lines 40-60); 
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"determining at least one webpage with the data, wherein the determination of 
the webpage is provided by the database-structured query" as extracting web pages 
that contains data by the commands (col. 9, lines 55-67; col. 10, lines 1-5); 

"parsing the data at the at least one webpage in search of data that satisfies a 
query condition" as (col. 15, lines 1-10; table 2, col. 12, lines 1-20); 

"wherein the data at the at least one web page is directly processed as though it 
is a searchable database" as the data receives 620 receives the web pages and 
extracts the requested data from those pages. The above information shows that each 
web page or website is a searchable database (col. 10, lines 1-5); 

"whereby a non-database structured arrangement of data is retrieved at the least 
one webpage and the database-structured query is performed upon the retrieved non- 
database structured arrangement of data" as each command is performed upon flat files 
containing data that are not arranged as a relational database at the website or web 
page (col. 2, lines 27-32; col. 9, lines 55-67; col. 10, lines 1-5); 

"extracting at least a portion of the data from the retrieved non-database 
structured arrangement of data that satisfies the query condition" as extracting data at 
a web page that satisfies the query condition (col. 15, lines 1-20); 

Madnick does not explicitly teach the claimed limitation " reshaping the extracted 
data to a predetermined format; providing extracted data in data log directly to the user". 

Bates teaches storing extracted documents as results in result cache and 
returning a first document as a first result from the result cache directly to a user (fig. 10, 
col. 11, lines 60-67; col. 12, lines 1-27). 
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lizuka teaches outputting the search result in a prescribed single format that is 
specific to each user. In particularly, converting the search result into the item 
presentation styles of each user according to the style conversion data (col. 5, lines 5- 
10; col. 5, lines 35-40). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply lizuka's teaching of outputting the search result in a 
prescribed single format that is specific to each user. In particularly, converting the 
search result into the item presentation styles of each user according to the style 
conversion data lizuka's teaching of the apparatus returns directly the search result to a 
user via a user interface unit 1 1 to Madnick's system in order to provide a result to a 
user quickly after retrieving data from a plurality of semi-structured document via open 
network without need a lot of time and labor to design and manage, to retrieve data 
contained in a plurality of semi-structured documents over open network quickly, 
eliminate network traffic when server receives multiple user's request from at the same 
time and to provide a good view of a search result to a user's system for viewing easily. 

As to claim 37, Madnick teaches the claimed limitation "wherein the structured 
query is generated to parse a limited portion of the data of the at least one webpage 
with the limits predetermined by the user" as (col. 12, lines 1-10, table 2). 
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As to claim 38, Madnick teaches the claimed limitation "wherein structured query 
. is generated to search for at least one of a text string, a table, and a predefined list of 
words" as (col. 2, lines 30-55). 

5. Claims 4, 35 and 36 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Madnick et al (or hereinafter "Madnick") (US 5913214) in view of of lizuka et al (or 
hereinafter "lizuka") (US 6424980) and Bates et al (or hereinafter "Bates") (US 
6873982) and further in view of Hennings et al (or hereinafter "Hennings") (US 
6763496). 

As to claim 4, Madnick does not explicitly teach the claimed limitation "following 
links contained within the web domain until the links have been exhausted or following 
the links until a predetermined limit is reached". Hennings teaches following the links 
until the Caribbean.htm is reached. Caribbean.html is represented as a predetermined 
limit (fig. 8). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply Hennings 1 s teaching of following the links until the 
Caribbean.htm is reached to Madnick' s system in order to response to a customer's 
request for more detailed information about a document on a web page. 

As to claim 35, Madnick does not explicitly teach the claimed limitation "wherein 
the search of data is performed on at least a second webpage". Hennings teaches at 
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least one link:http://www.traveltickets.com to http://www.traveltickets.com/cnjises for 
locating Caribbean data to extract (fig. 8). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply Hennings' s teaching of at least one 
link:http://www.traveltickets.com to http://www.traveltickets.com/cruises for locating 
Caribbean data to extract to Madnick's system in order to response to a customer's 
request for more detailed information about a document on a web page. 

As to claim 36, Madnick does not explicitly teach the claimed limitation "wherein 
parsing the data of the at least one webpage further comprises following links included 
on the webpage and further parsing the data of webpages determined by the links 
included on the webpage". Hennings teaches a first web page comprises links and 
parsing data as shown in fig. 1B to determine links included on the web page (fig. 8). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply Hennings 1 s teaching of a first web page comprises 
links and parsing data as shown in fig. 1B to determine links included on the web page 
to Madnick's system in order to response to a customer's request for more detailed 
information about a document on a web page. 

6. Claim 7 is rejected under 35 U.S.C. 103(a) as being unpatentable over Madnick et 
al (or hereinafter "Madnick") (US 5913214) in view of of lizuka et al (or hereinafter 
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"lizuka") (US 6424980) and Bates et al (or hereinafter "Bates") (US 6873982) and 
further in view of Jammes. 

As to claim 7, Madnick does not explicitly teach the claimed limitation "wherein 
creating the database structured query, further comprises, creating a condition 
expression with the database structured query describing how to scan the data at the 
determined web domain address for the data to extract". Jammes teaches as the 
following is one example of a name/value pair representing a query generated by the 
lnitial_Event_Handler to extract product data related to the root level group: query 
=select Product_name, ProductJD From Relationships, Groups where ID_type = G and 
ID=1000 and relationship = Contains And (col. 22, lines 15-20). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the Invention was made to apply Jammes's teaching of the following is one example of 
a name/value pair representing a query generated by the lhitial_Event_Handler to 
extract product data related to the root level group: query =select Product_name, 
ProductJD From Relationships, Groups where ID_type = G and ID=1000 and 
relationship = Contains to Madnick's system in order to retrieve data in different type of 
data structures corresponding to a user's request. 

7. Claims 8-9 are rejected under 35 U.S.C. 103(a) as being unpatentable over Madnick 
et al (or hereinafter "Madnick") (US 5913214) in view of lizuka et al (or hereinafter 
"lizuka") (US 6424980) and Bates et al (or hereinafter "Bates") (US 6873982) and 
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further in view of Jammes and Christianson et al (or hereinafter "Christianson") (US 
6085186). 

As to claim 8, Madnick discloses the claimed limitation subject matter in claim 1 , 
except the claimed limitation "wherein directly extracting the data from the determined 
web domain, further comprises: retrieving data from the determined web domain 
address; reducing the retrieved data to a region of interest; and searching the region of 
interest for the data matching a predetermined regular expression". 

Jammes teaches the claimed limitation "reducing the retrieved content to a 
region of interest" as an HTML coded result set: web/sedans.html>Sedans </A. This 
information shows the system reduced the retrieved content to a region of interest as 
Sedans (col. 22, lines 22-45). 

Christianson teaches "searching the region of interest for the data matching a 
predetermined regular expression" as matching the returned html text with regular 
expression (col. 20, lines 65-67). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply Jammes's teaching of reducing the retrieved content to 
a region of interest as Sedans and Christianson teaching of matching the returned html 
text with regular expression to Madnick's system in order to Madnick's system in order 
to retrieve data in different type of data structures corresponding to a user's request and 
to determine which sources are relevant to a given query, forwarding the query to the 
most relevant information sources, and further to provide regular expression component 
for creating modular hierarchical descriptions of regular expressions, for binding 
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variables to the correct sub-strings recognized during pattern match to a response of an 
information source, for performing arbitrary action language statements with multiple 
variable bindings. 

As to claim 9, Madnick discloses the claimed limitation subject matter in claim 1 , 
except the claimed limitation "wherein directly extracting the data from the web domain, 
further comprises, storing the data matching the predetermined regular expression". 
Jammes teaches retrieving data records whose status fields match a predetermined 
status value and that a corresponding result set would be generated. This information 
shows that the system stores matched records (col. 26, lines 25-50). 
It would have been obvious to a person of an ordinary skill in the art at the time the 
invention was made to apply Jammes's teaching of retrieving data records whose status 
fields match a predetermined status value and that a corresponding result set would be 
generated to Madnick's system in order to backup a system when the system is 
corrupted. 

8. Claims 11-13, 15-16, 27-28, 30-33 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Madnick et al (or hereinafter "Madnick") (US 5913214) in view of 
Bates et al (or hereinafter "Bates") (US 6873982). 

As to claim 1 1 , Madnick teaches a computer-readable medium having computer- 
executable instructions for extracting data from a network (a memory having one or 
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more commands to issue to the web page in order to retrieve the data from a network, 
col. 3, lines 21-26), "the computer-executable instruction enabling actions" (commands 
are enable for accessing the data and retrieving the data. Accessing and retrieving are 
represented as actions (col. 3, lines 20-26) comprises: 

creating a database-structured query with at least one fundamental 
clauseincluding a web domain address used for locating data, based, in part, on a user 
input" as the request translator receives a data request from data receiver 102 and 
translates the data request into a query. The converter query converts a least a portion 
of the query into a command to interact with a semi-structured data sources such as 
HTML documents, flat files containing data that are not arranged as a relational 
database. The above information shows that a command is created based on the data 
request. The data request from data receiver 102 is represented as a user input. The 
command is represented as the database-structured query (fig. 7; col. 2, lines 30-55; 
col. 8, lines 40-60); 

"locating data based on the web domain address provided by the database- 
structured query" as the descriptor file 702 may be a directory of URL addresses which 
locate necessary information about the data source 104. The above information shows 
that the data source is located based on the URL addresses. The URL address is 
represented as the web domain address (col. 10, lines 27-30), 

"extracting at least a portion of the located data directly by retrieving a non 
database structured arrangement of data from the located data and performing the 
database-structured query upon the retrieved non-database structured arrangement of 
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data" as at least a portion of the query is converted into one or more commands which 
can be used to interact with a semi-structured data source. Those commands are 
issued and data is extracted from the data source. In this case a source is located at 
an address or URL. The above information shows that the data is extracted from a 
semi-structured data source based on the address of the source and the command (fig. 
7, col. 10, lines 25-32; col. 2, lines 2-8). 

Madnick does not explicitly teach the claimed limitation " providing extracted data 
in data log directly to the user". 

Bates teaches storing extracted documents as results in result cache and 
returning a first document as a first result from the result cache directly to a user (fig. 10, 
col. 11, lines 60-67; col. 12, lines 1-27). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply Bates teaches storing extracted documents as results 
in result cache and returning a first document as a first result from the result cache 
directly to a user to Madnick's system in order to provide a result to a user quickly after 
retrieving data from a plurality of semi-structured document via open network without 
need a lot of time and labor to design and manage. 

As to claim 12, Madnick teaches the claimed limitation "wherein the database- 
structured query, further comprises, a network address included within the database- 
structured query, further comprises, a network address included within the database- 
structured query indicating a starting point" as creating a command after converting at 
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least a portion of a query, the command includes a network address as URL: 
http://quotes.qalt.com/ . Quotes.galt is indicated as a starting point (Table 2, col. 7, lines 
25-32; col. 2, lines 5-10). 

As to claim 13, "wherein the network address, further comprises at least one 
universal resource locator (URL)" as URL (col. 12, table 2). 

As to claim 15, "wherein the database-structured query, further comprises, a 
regular expression within the database-structured query used to determine the data to 
extract" as a regular expression with the file 706 as the database-structured query (col. 
12, table 2). 

As to claim 16, "wherein the regular expression within the database-structured 
query, further comprises at least one pattern, used to determine the data to extract" as 
each variable to be retrieved in a given state, the state description contains a pattern to 
be matched against the document or semi-structured data source. The above 
information shows that matching each pattern of each variable contained with the 
regular expression (col. 15, lines 1-10). 

As to claim 27, Madnick teaches the claimed limitations: 

"creating a database-structured query with at least one fundamental clause at the 
server based, in part, on a user input" as the request translator receives a data request 
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from data receiver 102 and translates the data request into a query at the wrapper 
generator 614. The converter query converts a least a portion of the query into a 
command to interact with a semi-structured data sources such as HTML documents, flat 
files containing data that are not arranged as a relational database. The above 
information shows that a command is created based on the data request. The data 
request from data receiver 102 is represented as a user input. The command is 
represented as the database-structured query. The wrapper generator 614 is 
represented as a server (fig. 7, col. 2, lines 30-55; col. 8, lines 40-60); 

"determining a website to search based in part on the database-structured query" 
as determining a URL on the network to extract the data implies determines a website 
(table 2, col. 12, lines 1-10, lines 1-5); 

"extracting at least a portion of the data relevant to the database-structured query 
at the website directly based on the database-structured query" as extracting the 
requested web pages to the wrapper generator 614 in response to the transmitted 
commands (col. 9, lines 55-67; col. 10, lines 1-5); 

"wherein the website is processed as a searchable database" as the data 
receives 620 receives the web pages and extracts the requested data from those 
pages. The above information shows that each web page or website is a searchable 
database (col. 10, lines 1-5); 

"whereby a non-database arrangement of data is retrieved from the website and 
the database-structured query is performed upon at least the retrieved non-database 
arrangement of the data " as each command is performed upon flat files containing data 



Application/Control Number: 09/833,846 Page 21 

Art Unit: 2162 

that are not arranged as a relational database at the website or web page (col. 2, lines 
27-32; col. 9, lines 55-67; col. 10, lines 1-5). 

"to extract at least the portion of the data from the retrieved non-database 
arrangement of the data" as extracting the data from the HTML documents that the non- 
database arrangement of the data (col. 2, lines 27-32; col. 10, lines 1-5). 

Madnick does not explicitly teach the claimed limitation "providing extracted data 
in data log directly to the user". 

Bates teaches storing extracted documents as results in result cache and 
returning a first document as a first result from the result cache directly to a user (fig. 10, 
col. 11, lines 60-67; col. 12, lines 1-27). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply Bates teaches storing extracted documents as results 
in result cache and returning a first document as a first result from the result cache 
directly to a user to Madnick's system in order to provide a result to a user quickly after 
retrieving data from a plurality of semi-structured document via open network without 
need a lot of time and labor to design and manage. 

As to claim 28, Madnick teaches the claimed limitation "parsing the database- 
structure query to determine at least one link to search at the website" as (col. 12, lines 
1-20, table 2). 
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As to claim 30, Madnick teaches the claimed limitation "determining what data to 
extract based in part on the database-structured query and the provided web domain 
address" as (col. 12, lines 1-20, table 2). 

As to claim 31 , Madnick and lizuka teach the claimed limitation subject matter in 
claim 27, lizuka further teaches the claimed limitation "wherein extracting data based in 
part on at least one of an Hypertext Markup Language (HTML) table, a binary file, and a 
matching pattern" as extracting data based on an HTML table (col. 14, lines 34-40). 

As to claim 32, Madnick teaches the claimed limitation "reshaping the extracted 
data for at least one of a database, a spreadsheet, Extensible Markup Language (XML) 
display, and a statistical tool" as (col. 3, lines 1-8). 

As to claim 33, Madnick teaches the claimed limitation "wherein the website is a 
starting website based in part on the database-structured query" as (col. 10, lines 1-5). 

9. Claims 14, 25 and 29 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Madnick et al (or hereinafter "Madnick") (US 5913214) in view of Bates and further 
in view of Hennings et al (or hereinafter "Hennings") (US 6763496). 

As to claim 14, Madnick does not explicitly teach the claimed limitation 

"at least one link to another web domain address for locating data to extract". 
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Hennings teaches at least one link:http://www.traveltickets.com to 
http://www.traveltickets.com/cruises for locating Caribbean data to extract (fig. 8). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply Hennings' s teaching of at least one 
link:http://www.traveltickets.com to http://www.traveltickets.com/cruises for locating 
Caribbean data to extract to Madnick's system in order to response to a customer's 
request for more detailed information about a document on a web page. 

As to claim 25, Madnick does not explicitly teach the claimed limitation "at least 
one link address that is followed to locate data to extract until a predetermined number 
of links is reached". Hennings teaches following the links until the.Caribbean.htm is 
reached. Caribbean.html is represented as a predetermined limit (fig. 8). 

It would have been obvious to a person of an ordinary skill in the art at the time 
the invention was made to apply Hennings' s teaching of following the links until the 
Caribbean.htm is reached to Madnick's system in order to response to a customer's 
request for more detailed information about a document on a web page. 

As to claim 29, Madnick teaches the claimed limitation "determining at least one 
other website to search based in part on the database-structured query and a provided 
web domain address" as (col. 9, lines 55-67; col. 10, lines 1-5). 

Madnick does not explicitly teach the claimed limitation "extracting at least 
another portion of the data at the at least one other website based on the database- 
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structured query and the provided web domain address, wherein the at least one other 
website include a non-database structured arrangement of data that is processed as a 
searchable database". Hennings teaches extracting Golfing data at a second web 
page. This web page includes a HTML document as a non-database structured 
arrangement of data (fig. 8). 

It would have been obvious to a person of an ordinary in the art at the time the 
invention was made to apply Hennings's teaching of extracting Golfing data at a second 
web page to Madnick's system in order to retrieve data contained in a plurality of semi- 
structured documents over a network. 
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Conclusion 

10. Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP 
§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 
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